75 research outputs found
Inferring Strategies for Sentence Ordering in Multidocument News Summarization
The problem of organizing information for multidocument summarization so that
the generated summary is coherent has received relatively little attention.
While sentence ordering for single document summarization can be determined
from the ordering of sentences in the input article, this is not the case for
multidocument summarization where summary sentences may be drawn from different
input articles. In this paper, we propose a methodology for studying the
properties of ordering information in the news genre and describe experiments
done on a corpus of multiple acceptable orderings we developed for the task.
Based on these experiments, we implemented a strategy for ordering information
that combines constraints from chronological order of events and topical
relatedness. Evaluation of our augmented algorithm shows a significant
improvement of the ordering over two baseline strategies
Reduced model for female endocrine dynamics: Validation and functional variations
A normally functioning menstrual cycle requires significant crosstalk between
hormones originating in ovarian and brain tissues. Reproductive hormone
dysregulation may cause abnormal function and sometimes infertility. The
inherent complexity in this endocrine system is a challenge to identifying
mechanisms of cycle disruption, particularly given the large number of unknown
parameters in existing mathematical models. We develop a new endocrine model to
limit model complexity and use simulated distributions of unknown parameters
for model analysis. By employing a comprehensive model evaluation, we identify
a collection of mechanisms that differentiate normal and abnormal phenotypes.
We also discover an intermediate phenotype--displaying relatively normal
hormone levels and cycle dynamics--that is grouped statistically with the
irregular phenotype. Results provide insight into how clinical symptoms
associated with ovulatory disruption may not be detected through hormone
measurements alone
Making effective use of healthcare data using data-to-text technology
Healthcare organizations are in a continuous effort to improve health
outcomes, reduce costs and enhance patient experience of care. Data is
essential to measure and help achieving these improvements in healthcare
delivery. Consequently, a data influx from various clinical, financial and
operational sources is now overtaking healthcare organizations and their
patients. The effective use of this data, however, is a major challenge.
Clearly, text is an important medium to make data accessible. Financial reports
are produced to assess healthcare organizations on some key performance
indicators to steer their healthcare delivery. Similarly, at a clinical level,
data on patient status is conveyed by means of textual descriptions to
facilitate patient review, shift handover and care transitions. Likewise,
patients are informed about data on their health status and treatments via
text, in the form of reports or via ehealth platforms by their doctors.
Unfortunately, such text is the outcome of a highly labour-intensive process if
it is done by healthcare professionals. It is also prone to incompleteness,
subjectivity and hard to scale up to different domains, wider audiences and
varying communication purposes. Data-to-text is a recent breakthrough
technology in artificial intelligence which automatically generates natural
language in the form of text or speech from data. This chapter provides a
survey of data-to-text technology, with a focus on how it can be deployed in a
healthcare setting. It will (1) give an up-to-date synthesis of data-to-text
approaches, (2) give a categorized overview of use cases in healthcare, (3)
seek to make a strong case for evaluating and implementing data-to-text in a
healthcare setting, and (4) highlight recent research challenges.Comment: 27 pages, 2 figures, book chapte
Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2
Background: The ShARe/CLEF eHealth challenge lab aims to stimulate development of natural language
processing and information retrieval technologies to aid patients in understanding their clinical reports. In clinical
text, acronyms and abbreviations, also referenced as short forms, can be difficult for patients to understand. For one
of three shared tasks in 2013 (Task 2), we generated a reference standard of clinical short forms normalized to the
Unified Medical Language System. This reference standard can be used to improve patient understanding by
linking to web sources with lay descriptions of annotated short forms or by substituting short forms with a more
simplified, lay term.
Methods: In this study, we evaluate 1) accuracy of participating systems’ normalizing short forms compared to a
majority sense baseline approach, 2) performance of participants’ systems for short forms with variable majority
sense distributions, and 3) report the accuracy of participating systems’ normalizing shared normalized concepts
between the test set and the Consumer Health Vocabulary, a vocabulary of lay medical terms.
Results: The best systems submitted by the five participating teams performed with accuracies ranging from 43 to
72 %. A majority sense baseline approach achieved the second best performance. The performance of participating
systems for normalizing short forms with two or more senses with low ambiguity (majority sense greater than
80 %) ranged from 52 to 78 % accuracy, with two or more senses with moderate ambiguity (majority sense
between 50 and 80 %) ranged from 23 to 57 % accuracy, and with two or more senses with high ambiguity
(majority sense less than 50 %) ranged from 2 to 45 % accuracy. With respect to the ShARe test set, 69 % of short
form annotations contained common concept unique identifiers with the Consumer Health Vocabulary. For these
2594 possible annotations, the performance of participating systems ranged from 50 to 75 % accuracy.
Conclusion: Short form normalization continues to be a challenging problem. Short form normalization systems
perform with moderate to reasonable accuracies. The Consumer Health Vocabulary could enrich its knowledge base
with missed concept unique identifiers from the ShARe test set to further support patient understanding of
unfamiliar medical terms.</p
Self-reported use of complementary and alternative medicine (CAM) products in topical treatment of diabetic foot disorders by diabetic patients in Jeddah, Western Saudi Arabia
<p>Abstract</p> <p>Background</p> <p>There is little published on current Saudi diabetic patients' practices when they are exposed to foot disorders such as open wound, ulcer, and skin cracks. These factors are usually influenced by local culture and communities beliefs. The aim of the current study was to identify the pattern of patients' use of CAM products in dealing with diabetic foot disorders topically in a group of diabetic patients.</p> <p>Findings</p> <p>A Cross-sectional descriptive study of a representative cohort of diabetic patients living in Jeddah, Saudi Arabia was designed. A pre-designed questionnaire to identify local diabetics' practices in dealing topically with foot disorders including open wound, chronic ulcer, and skin cracks was designed. Questionnaire was administered by a group of trained nutrition female students to diabetics face to face living in their neighborhood. A total of 1634 Saudi diabetics were interviewed. Foot disorders occurred in approximately two thirds of the respondents 1006 (61.6%). Out of the 1006 patients who had foot disorders, 653 reported trying some sort of treatment as 307 patients (47.1%) used conventional topical medical treatment alone, 142 (21.7%) used CAM products alone, and 204 (31.2%) used both treatments. The most commonly used CAM product by the patients was Honey (56.6%) followed by Commiphora Molmol (Myrrh) in (37.4%) and Nigellia Sativa (Black seed) in (35.1%). The least to be used was Lawsonia inermis (Henna) in (12.1%). Ten common natural preparations used topically to treat diabetic foot disorders were also identified.</p> <p>Conclusions</p> <p>The use of CAM products in topical treatment of diabetic foot disorders is fairly common among Saudi diabetic patients. Honey headed the list as a solo topical preparation or in combination with other herbs namely black seeds and myrrh. The efficacy of the most common products needs further research.</p
The structural and content aspects of abstracts versus bodies of full text journal articles are different
<p>Abstract</p> <p>Background</p> <p>An increase in work on the full text of journal articles and the growth of PubMedCentral have the opportunity to create a major paradigm shift in how biomedical text mining is done. However, until now there has been no comprehensive characterization of how the bodies of full text journal articles differ from the abstracts that until now have been the subject of most biomedical text mining research.</p> <p>Results</p> <p>We examined the structural and linguistic aspects of abstracts and bodies of full text articles, the performance of text mining tools on both, and the distribution of a variety of semantic classes of named entities between them. We found marked structural differences, with longer sentences in the article bodies and much heavier use of parenthesized material in the bodies than in the abstracts. We found content differences with respect to linguistic features. Three out of four of the linguistic features that we examined were statistically significantly differently distributed between the two genres. We also found content differences with respect to the distribution of semantic features. There were significantly different densities per thousand words for three out of four semantic classes, and clear differences in the extent to which they appeared in the two genres. With respect to the performance of text mining tools, we found that a mutation finder performed equally well in both genres, but that a wide variety of gene mention systems performed much worse on article bodies than they did on abstracts. POS tagging was also more accurate in abstracts than in article bodies.</p> <p>Conclusions</p> <p>Aspects of structure and content differ markedly between article abstracts and article bodies. A number of these differences may pose problems as the text mining field moves more into the area of processing full-text articles. However, these differences also present a number of opportunities for the extraction of data types, particularly that found in parenthesized text, that is present in article bodies but not in article abstracts.</p
Enveloping Sophisticated Tools into Process-Centered Environments
We present a tool integration strategy based on enveloping pre-existing tools without source code modifications or recompilation, and without assuming an extension language, application programming interface, or any other special capabilities on the part of the tool. This Black Box enveloping (or wrapping) idea has existed for a long time, but was previously restricted to relatively simple tools. We describe the design and implementation of, and experimentation with, a new Black Box enveloping facility intended for sophisticated tools --- with particular concern for the emerging class of groupware applications
Antimicrobial resistance among migrants in Europe: a systematic review and meta-analysis
BACKGROUND: Rates of antimicrobial resistance (AMR) are rising globally and there is concern that increased migration is contributing to the burden of antibiotic resistance in Europe. However, the effect of migration on the burden of AMR in Europe has not yet been comprehensively examined. Therefore, we did a systematic review and meta-analysis to identify and synthesise data for AMR carriage or infection in migrants to Europe to examine differences in patterns of AMR across migrant groups and in different settings. METHODS: For this systematic review and meta-analysis, we searched MEDLINE, Embase, PubMed, and Scopus with no language restrictions from Jan 1, 2000, to Jan 18, 2017, for primary data from observational studies reporting antibacterial resistance in common bacterial pathogens among migrants to 21 European Union-15 and European Economic Area countries. To be eligible for inclusion, studies had to report data on carriage or infection with laboratory-confirmed antibiotic-resistant organisms in migrant populations. We extracted data from eligible studies and assessed quality using piloted, standardised forms. We did not examine drug resistance in tuberculosis and excluded articles solely reporting on this parameter. We also excluded articles in which migrant status was determined by ethnicity, country of birth of participants' parents, or was not defined, and articles in which data were not disaggregated by migrant status. Outcomes were carriage of or infection with antibiotic-resistant organisms. We used random-effects models to calculate the pooled prevalence of each outcome. The study protocol is registered with PROSPERO, number CRD42016043681. FINDINGS: We identified 2274 articles, of which 23 observational studies reporting on antibiotic resistance in 2319 migrants were included. The pooled prevalence of any AMR carriage or AMR infection in migrants was 25·4% (95% CI 19·1-31·8; I2 =98%), including meticillin-resistant Staphylococcus aureus (7·8%, 4·8-10·7; I2 =92%) and antibiotic-resistant Gram-negative bacteria (27·2%, 17·6-36·8; I2 =94%). The pooled prevalence of any AMR carriage or infection was higher in refugees and asylum seekers (33·0%, 18·3-47·6; I2 =98%) than in other migrant groups (6·6%, 1·8-11·3; I2 =92%). The pooled prevalence of antibiotic-resistant organisms was slightly higher in high-migrant community settings (33·1%, 11·1-55·1; I2 =96%) than in migrants in hospitals (24·3%, 16·1-32·6; I2 =98%). We did not find evidence of high rates of transmission of AMR from migrant to host populations. INTERPRETATION: Migrants are exposed to conditions favouring the emergence of drug resistance during transit and in host countries in Europe. Increased antibiotic resistance among refugees and asylum seekers and in high-migrant community settings (such as refugee camps and detention facilities) highlights the need for improved living conditions, access to health care, and initiatives to facilitate detection of and appropriate high-quality treatment for antibiotic-resistant infections during transit and in host countries. Protocols for the prevention and control of infection and for antibiotic surveillance need to be integrated in all aspects of health care, which should be accessible for all migrant groups, and should target determinants of AMR before, during, and after migration. FUNDING: UK National Institute for Health Research Imperial Biomedical Research Centre, Imperial College Healthcare Charity, the Wellcome Trust, and UK National Institute for Health Research Health Protection Research Unit in Healthcare-associated Infections and Antimictobial Resistance at Imperial College London
Are decision trees a feasible knowledge representation to guide extraction of critical information from randomized controlled trial reports?
<p>Abstract</p> <p>Background</p> <p>This paper proposes the use of decision trees as the basis for automatically extracting information from published randomized controlled trial (RCT) reports. An exploratory analysis of RCT abstracts is undertaken to investigate the feasibility of using decision trees as a semantic structure. Quality-of-paper measures are also examined.</p> <p>Methods</p> <p>A subset of 455 abstracts (randomly selected from a set of 7620 retrieved from Medline from 1998 – 2006) are examined for the quality of RCT reporting, the identifiability of RCTs from abstracts, and the completeness and complexity of RCT abstracts with respect to key decision tree elements. Abstracts were manually assigned to 6 sub-groups distinguishing whether they were primary RCTs versus other design types. For primary RCT studies, we analyzed and annotated the reporting of intervention comparison, population assignment and outcome values. To measure completeness, the frequencies by which complete intervention, population and outcome information are reported in abstracts were measured. A qualitative examination of the reporting language was conducted.</p> <p>Results</p> <p>Decision tree elements are manually identifiable in the majority of primary RCT abstracts. 73.8% of a random subset was primary studies with a single population assigned to two or more interventions. 68% of these primary RCT abstracts were structured. 63% contained pharmaceutical interventions. 84% reported the total number of study subjects. In a subset of 21 abstracts examined, 71% reported numerical outcome values.</p> <p>Conclusion</p> <p>The manual identifiability of decision tree elements in the abstract suggests that decision trees could be a suitable construct to guide machine summarisation of RCTs. The presence of decision tree elements could also act as an indicator for RCT report quality in terms of completeness and uniformity.</p
- …